Multi-Source Data Repairing: A Comprehensive Survey

نویسندگان

چکیده

In the era of Big Data, integrating information from multiple sources has proven valuable in various fields. To ensure a high-quality supply multi-source data, repairing different types errors data becomes critical. This paper categorizes into entity overlapping, attribute value conflicts, and inconsistencies. We first summarize existing methods for these then examine review study detection repair compound-type data. Finally, we indicate further research directions repair.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Relational Data Mining A Comprehensive Survey

Multi-Relational Data Mining or MRDM is a growing research area focuses on discovering hidden patterns and useful knowledge from relational databases. While the vast majority of data mining algorithms and techniques look for patterns in a flat single-table data representation, the sub-domain of MRDM looks for patterns that involve multiple tables (relations) from a relational database. This sub...

متن کامل

Fuzzy multi-criteria selection procedures in choosing data source

Technology assessment and selection has a substantial impact on organizations procedures in regards to technology transfer. Technological decisions are usually made by a group of experts, and whereby integrity of these viewpoints to a single decision can be quite complex. Today, operational databases and data warehouses exist to manage and organize data with specific features and henceforth, th...

متن کامل

A Comprehensive Survey of Data Processing Approaches

In Wireless Sensor Networks (WSNs), energy of the sensor nodes are the most concerning factor because each sensor is equipped with limited amount of energy which is used to perform lots of work. Sensors have capabilities of sensing, analyzing, processing and communication of the data. In WSNs, the most responsible factor behind energy consumption of the sensor nodes is Data Processing which mai...

متن کامل

A survey of multi-source domain adaptation

In many machine learning algorithms, a major assumption is that the training and the test samples are in the same feature space and have the same distribution. However, for many real applications this assumption does not hold. In this paper, we survey the problem where the training samples and the test samples are from different distributions. This problem can be referred as domain adaptation. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2023

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math11102314